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Abstract 

Context-dependent nature of biological phenomena are well documented 
in every branch of biology. While there have been few previous attempts 
to (implicitly) model various facets of biological context-dependence, a 
formal and general mathematical construct to model the wide spectrum 
of context-dependence, eludes the students of biology. An objective and 
rigorous model, from 'bottom-up' and 'top-down' perspective is proposed 
here to describe the various kinds of context-dependence that we encounter 
in different branches of biology. Care has been taken at every step to 
match the assumptions with experimentally obtained data, so that the bi- 
ological relevance of this model is not compromised to maintain its math- 
ematical rigor. Rules of interactions and dependencies between biological 
contexts have been proposed too. It is found that a hierarchic nature of 
dependence amongst the biological contexts models the biological nature 
best. 
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'Context-dependence' is omnipresent in Biology. From the realm of substitu- 
tion of nucleotides[l,2] to the paradigm of protein structure-function[3], from 
the sphere of cellular dynamics[4] to that in evolutionary dynamics[5], one en- 
counters events and processes that are 'context-dependent' in copious amount. 
While various attempts have been made from differing perspectives to some- 
how quantify context-sensitiveness of particular biological events [6], a gen- 
eral mathematical framework that attempts to capture and describe ubiquitous 
'context-dependence' eludes the students of Biology. In the present work, such 
a mathematical structure is proposed that attempts to model the ubiquitous 
'context-dependence' from bottom-up as well as top-down perspectives. 

We suggest the triad of the form '< Structure of biological goals (F), Biological 
contexts (C), Physical structure of the system (P)>, (< F,C,P >), to describe 
the structure of any biological process. Components of this triad are by no 
means independent; F determines the suitable choice of C; while C, in its turn, 
operates upon some particular subset of P with definite features. To ensure F is 
achieved, C engages only certain elements of P. It is important to mention that 
same physical structures can be subjected to different contexts to achieve differ- 
ent biological goals; for example, same proteins under different set of contexts 
(the state of the biological system and protein localization) may be involved in 
different biological processes, so that the different goals of these processes are 
achieved. [7]. It assumes importance to note here that the present approach 
is different both in its motivation and mathematical formalism from an earlier 



2 



work [8] that had attempted to model the network of biological interactions. 
Our work attempts to formulate the general and formal principles of contextual 
interactions by modeling the nature the dependencies between biological con- 
texts, which operate upon physical (structural) parameters, to ensure that any 
particular biological goal is achieved. 

Since the motivation of any biological process is solely to accomplish the neces- 
sary biological goals and the structure of biological goals is hierarchic [9,10]; we 
propose a hierarchic organizational structure for biological contexts too. Thus, 
structure of C will assume that of a tree (namely the 'Context-Tree' (CT)), where 
the root-vertex will denote the global context whose achievement is caused by 
the underlying contexts. The lowest level of CT will be occupied by basic, ele- 
mentary contexts (- for example, the genes themselves, where a systematic com- 
bination of the molecular function of their products (proteins) is studied with re- 
spect to achieving a specific biological goal[9]), and their set A — {ai, 012, . . . , a n } 
will constitute the base of CT. This form of structure for S, lends itself well for 
analyzing the measure of performance of any context to actualize any particu- 
lar F, in the form Every context, including the global context, may be 
obtained as a composition of these basic contextual elements of CT. However, 
to accomplish a definite set of biological goals, the contexts describing various 
facets of a biological system need to interact between themselves. We can define 
a set of rules U (U = {a±, az, . . . , a n }) that governs the framework for compo- 
sition between the base elements of CT, denoted by a. We note here that set U 
might be infinite, and the compositions ciiacij with a£[/ need not necessarily 
be defined for all a*, aj € A. 



3 



The importance of set U is paramount, because it is this set that governs how the 
various elements of S will interact to construct the CT and eventually achieve 
F. Let us assign to each element a of the set U, an n x n incidence matrix 
[11] A a with entries ag- is unity if the composition a^aaj is defined, or zero 
otherwise. We can then introduce a matrix A u whose elements are given as : 



stating that — 1 if there exists a valid composition rule between a% and aj , 
taken in order. 

Matrix A u conforms to constraints of biological reality perfectly because it sug- 
gests that for certain magnitudes of i, the i th row and i th column of the matrix 
can be all zeros. These cases imply that interactions between certain basic 
elements of context-set A — {oi, az, . . . , a n } are not allowable biologically. We 
demonstrate such non-allowable biological contexts with two examples : 
Example- 1) When A phage (a virus that infects the bacteria Escherichia coli) 
encounters a bacterium, it attaches itself only to certain particular receptors 
with specific structural features, on the bacterial membrane. That is, the rel- 
evant biological contexts ensure that binding of A phage to various other can- 
didate receptor sites with slightly varying structural aspects, is not allowed. 
Subsequently, when the virus genome enters the bacterium, only two pathways 
(out of theoretically infinite number of pathways) of alternative nature, namely 
the 'lytic pathway' or the 'lysogenic pathway' are allowed biologically [8]. Again, 
it is the pertinent contexts which forbids the existence of any other pathways, 
although a theoretical thermodynamic study of the situation can suggest many 
possible pathways with (almost) similar efficiencies. 




(1) 
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Example-2) Out of the entire spectrum of possible mRNAs that can be gen- 
erated from a single gene, only one or a few are created at a time; biological 
contexts do not allow the other possibilities to come into being, although these 
theoretically possible contexts would have operated upon the same structural 
parameters. The set of allowable contexts to achieve this particular goal form 
the so-called 'regulatory mechanism' that includes an extremely sensitive bal- 
ance between concentration magnitudes of pertinent entities, the destination, 
sequence variety, structural diversity and finally the functional options of the 
resulting protein related to the type of tissue or the stage of development, etc.. 
[12]. 

Henceforth, there are no corresponding rules to describe the biological impos- 
sibilities in U. In other words, these compositions cannot participate in the 
construction of the tree CT. Removing these aforementioned entries, from the 
set A we obtain a non-redundant set of basic rules of context composition. From 
here onwards, we will denote this non-redundant set by set A. 

We can now attempt to describe a biological process of transportation. We 
assume three elements {01,02,03} from the set of basic contexts A. We de- 
note a\ as the context for loading, ai for delivery and 03 for unloading. To 
provide an example, we consider the case when axon of a neuron transports 
the transmission of action potentials from the cell body to the synapse. The 
complicated mechanism of action potential propagation and their active trans- 
portation from their site of synthesis in the cell body through the axoplasm 
to intracellular target sites in the axon and synapse, - provides an ideal case 
to demonstrate the various context-specific activities. Here The loading and 
delivery contexts [a\ and a-i respectively) involves the series synchronized cou- 
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plings between allowed set of specific membranous organelles, synaptic vesicle 
precursors, signaling molecules, growth factors, protein complexes, cytoskeletal 
components, sodium and potassium channels and many other biological compo- 
nents. On the other hand, the unloading context (03) describes the neurotrophic 
signals that are transported back from the synapse to the cell body, keeping an 
account of efficiency and reliability of loading and delivery operations[13]. Here 
we introduce the set of compositional rules U . U may then operate upon these 
three basic elements to ensure a sequential execution of elementary processes to 
ensure that final goal (F) is achieved. In this case the matrix A u can be written 
as : 





ai 


«2 


a 3 


fll 











a-i 


1 








03 





1 






Hence the elementary biological context, a, is defined for only two cases from the 
entire spectrum of possible contextual interactions; they are a^aai (implying 
the existence of a pipeline where context for delivery comes into action after the 
context for loading is ensured) and a^aa^ (implying the existence of context for 
unloading after the operation of context for delivery is performed) . It is interest- 
ing to note that a pipeline with a^aai is not biologically relevant, since it implies 
the existence of context for unloading after the context for loading without any 
delivery action. Similarly contextual relations like a\aa\ (loading certain con- 
texts repeatedly without a purpose), a\ctai (existence of the context to ensure 
delivery when the initiation of the process is not ensured), a^cxaz (context to 
ensure unloading when the delivery process is incomplete) and other spurious 
context-relations like aictai, arenas are assigned a magnitude zero because they 
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are biological impertinent. On a global scale, the composition 0,30:0,20(0,1 cap- 
tures the purposeful nature of biological goals. 

The findings of the last paragraph implies that interactions between biologi- 
cal contexts, are transitive but are not commutative (since 030:02 and 02001 are 
defined, 03002001 can be defined; but merely because 03002 exists doesn't im- 
ply that 02003 exists too). The non-commutative nature of biological contexts 
can be understood better from the general treatment of the problem elaborated 
later. 

The (bottom-up) paradigm of description of interplay of biological contexts can 
be generalized by describing the entire biological universe with the Thread-Mesh 
(TM) model[14]. The TM model segments the biological space-time into a series 
of different biological organizations, viz. the nucleotides; amino acids; macro- 
molecules (proteins, sugar polymers, glycoproteins); biochemical networks; bio- 
logical cell; tissue; organs; organisms; society and ecosystem; where these orga- 
nizational schemes are called threshold levels. Emergence of a single biological 
property (compositional or functional) creates a new biological threshold level 
in the TM model. Thus, if any arbitrarily chosen i th biological threshold level 
is denoted as THi, THj^i will be containing at least one biological property 
that TH; didn't possess. Somewhat similar schemes of identification of biolog- 
ical threshold levels is neither new [15,16] nor unique and there can be many 
other intermediary threshold levels too (considering the fact that existence of 
one emergent thread distinguishes TH;_|_i from TH;). For example, at high 
resolution one can consider the secondary, tertiary, quaternary structures too 
as separate threshold levels that exist between amino acids and the proteins. 
The basic principles for subsequent discourse, however, are general and can be 
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applied to any threshold level. Every possible property that a threshold level 
is endowed with, is represented by a 'thread' in the TM model. Thus an envi- 
ronmental property capable of influencing biological action will be called as an 
'environmental thread' in the present parlance. Threads can be compositional, 
structural or functional. For example, for the biological threshold level corre- 
sponding to the enzymes (threshold level representing the macromolecules) , one 
of the compositional threads is the amino acid sequence; whereas the radius of 
gyration, the resultant backbone dipole moment and each of the bond lengths, 
bond angles, torsion angles are some examples of structural threads and the 
values for K m , V max , K cat are some examples of it's functional threads. 

To describe the 'Context-Tree'(CT) in such hierarchic paradigm in a general 
manner, we introduce a construct C, which is a family of embedded partitions 
of contexts C =< C 1 , C 2 , . . . , C r > that operate upon any relevant subset of 
structural threads (J) representing the physical structure (P) of any arbitrarily 
chosen threshold level S. J C P and J = {1,2, ... ,m}. For example, it has 
been found [3] that at the threshold level of proteins; (S=Threshold Proteins) 
in an urea- induced media (the 'environmental thread' influencing J), the extent 
of stability of mutant proteins are highly dependent on the contexts (C) which 
operate upon the various structural parameters (J), that form a subset of (P) 
describing (S). 

We can describe the situation as : 
C7 S =< Cf , Ci, ...,Cf> U< =1 Cf = J, CfnC? = {%+ j) , S = T7T (3) 
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The embedding refers to any element of the partition of the s biological thresh- 
old level; i.e., the set C? represents the union of several sets C? _1 , C? _1 , . . . , Cf~ 
of the (S — l) th biological threshold level. Such description of (CT) conforms 
to a previous study on similar topic [6]. Findings from a recent study[17] vindi- 
cates Cf DCj = 0. To elaborate the hierarchic structure, we can write C < — >< 

c i _1 > C 2~ l > C i _1 >> if cS = u i=i c i _1 - Since the entire set of interactions 
between various contexts is ultimately to satisfy biological goals and since the 
nature of organization of biological goals can also be assumed to be hierarchic, 
we attempt to describe it by denning I (C s ) = I and C' root —< {1, 2, . . . , m} > 
; i.e., the partition at the highest (root) level consists of one set, namely J. 

We can associate each element Cj (s = 2, r) of the partition to the context- 
interaction function, namely (cti, a 2 , ■ ■ ■ , o^c" 5 )) , where a G {0,-1, +1}, 
conforming to the previously defined U (U — {ct\, a 2 , ■ ■ ■ , a„}). We associate 
each element Cj of the first level to a binary relation Rj (the previously denned 
elementary contexts are related by this, say aRb, where A = {a, b, ... , z}) on 
the biological sub-space ECj. 

These concepts can formally be described as : 

Let Cj < — >< C\, . . . , Cj > and define a relation on E C 2 using the formula 
(for I > 1) : 

a c] Rlc] <s==> fj ("i ' "2 , • • • , m ) = 1 (4) 
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oil = +1 if a c iR t b c i 
on = — 1 if a c iRb c i 
Oii = if a c i = b c i 

In case of I = 1, i?| = If all the relations of the (5 — l) th biological 

threshold level are defined, then the relations Rj of the S th threshold level with 
(I > 1) can be defined by the following construct : 

If Cf < — >< Cf-\ C't 1 , Cf-\ > , then 

a cf R bcf // (a 1 ,a 2 ,...,ai) = l (5) 

aj = +1 if a (J s-iRf~ 1 b c s-i 
on = —1 if a c s-iR s ^ 1 b c s-i 
a.i = i/ a c s-i — b c s-i 

If C s = C 5_1 , then R s = i? s ~ 1 ; in other words the construction requires the 
relation R to coincide with R r , which is a single relation at the r th upper level. 

To describe the entire bottom-up paradigm of description of interaction scheme 
between biological contexts, we consider an example where we describe the con- 
textual constraints on the active site of an enzyme in simplistic terms. Without 
any loss of generality, we consider the the threshold level representing proteins 
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to be the root level in this case. The goal of the system (F) for the present 
purpose is to make the enzyme functional. We assume the elementary contexts 
that can influence functionality of the enzyme active site to be represented with 
3 basic partitions; namely, first, the contextual differences originating out of 
internal coordinates; second, contextual differences arising out of interaction 
profile of the active site atoms with water; and third, contextual differences 
arising out of the capability of the active site to undergo a shape change. Hence 
we describe the family of partitions C, as C 1 =< {1,2,3}, {4,5}, {6} > and 
C 2 =< J >; where element 1 denotes (possible) contextual difference arising 
out of the fluctuation of bond lengths, element 2 denotes (possible) contex- 
tual difference arising out of the fluctuation of bond angles, element 3 denotes 
(possible) contextual difference arising out of the fluctuation of torsion angles. 
Similarly, element 4 stands for (possible) contextual difference arising out of the 
hydrophobicity of active site patch, element 5 denotes the (possible) contextual 
difference arising out of the local electrostatic profile of the active site patch. 
Element 6 denotes the extent of (possible) contextual difference arising out of 
the change in the local shape of the active site patch. Denoting the set {1, 2, 3} 
as D\, {4,5} as D 2 and {6} as D§, the hierarchic nature of these contextual 
dependencies can easily be described as : 
D 2 RD l <^=> [D 2 > £>i] 
similarly, 

D 3 RD 2 <^=> [D 3 > D 2 ] and D z RD l <^=> [D 3 > D x ] 

Implying that a (possible) contextual difference arising out of the hydrophobicity 
of active site patch, or a (possible) contextual difference due to the local electro- 
static profile of the active site patch will surely account for some change in the 
distribution profile of bond length, bond angle and torsion angle distribution. 
But the vice- verse of the case described, viz., a change in the local electrostatics 
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profile and local hydrophobic profile due to a change in bond-length, bond-angle 
or torsion angle might or might not be observed in reality. Similarly, in case of a 
possible change in local shape the local electrostatic profile, local hydrophobicity 
profile, local distribution of bond-length, bond-angle, torsion angle will surely 
be taking place; but the other way round might or might not be observed. This 
vindicates and generalizes our previous finding that interactions between bio- 
logical contexts, are transitive but are not commutative (hence, if D 2 RD\ and 
D 3 RD 2 are defined, D 3 RD 2 RD 1 can be defined; but merely because D 3 RD 2 
exists doesn't imply that D 2 RD% exists too). 

While framework of eq n s [1 — 5] along with examples 1 and 2 describe the 
nature of multilevel organization of CT, such description is 'bottom-up' in na- 
ture. Hence, while it is helpful to describe the context-mapping between any two 
particular adjacent biological threshold levels levels 'I' and 'l + l', (say between 
threshold levels representing nucleotides and amino acids, amino acids and pro- 
teins, or between proteins and biochemical pathways, etc ..) the general mode 
of dependency within CT with a birds-eye ('top-down') view of the organization 
of it, can hardly be guessed from such bottom-up approach. 

We start the construction of the top-down scheme of description of dependen- 
cies between biological contexts, by enlisting the assumptions involved therein. 
Hence : 

Assumption- 1 ) : In absence of random external disturbances and without a 
failure of any component belonging to physical structure of the system (P) all 
the rules of multilevel interactions between the contexts representing any biolog- 
ical threshold level, can be constructed and described in deterministic manner. 



12 



Success of recent attempts with deterministic modeling of various biological phe- 
nomena from diverse backgrounds [18-20] suggest that such assumption is not 
ill-founded, and that too in absence of possible perturbations. 
Assumption-2 ) : The necessary and sufficient condition for the feasibility of 
these deterministic rules of inter- level context interactions, is whether they ac- 
count for the accomplishment of certain biological goals (F) or not. Previous 
studies [8-10] vindicate such assumption. 

Assumption-3 ) : Although biological systems will be exposed to randomly 
varying magnitudes of external parameters, the essence of the deterministic 
criteria of context interactions to accomplish the required biological function 
will not be perturbed by significant amount. This assumption implies that de- 
terministic manner of context interactions will not be undergoing significant 
change when the magnitudes of components of underlying physical structures 
{Pi} (Pi G P), comprised of relevant biological parameters, are altered within 
some allowable range. We describe this allowable range of assumed magnitude 
of some arbitrarily chosen parameter w by an interval [wo, w±] . 
Relevance of this assumption comes into light when one analyzes the nature of 
some previous models. Since every biological property operates within a speci- 
fied bound of magnitude, something that has been referred to as 'fluctuation' in 
an earlier study [15] the functions that represent them will also be bounded in 
their ranges. Examples for such fluctuation are many; in the biological threshold 
level representing cells, for the mitogen-activated protein kinase cascade studies, 
the total concentrations of MKKK, MKK and MAPK have been found to be 
in the range 10-1000 nm and the estimates for the kcat values of the protein 
kinases and phosphatases have been found to range from 0.01 to 1 s" 1 [21]. Sim- 
ilarly, for the proteins, the mass fractal dimension representing compactness of 
the protein has been found to be in the range of 2.22 to 2.69 [22]. 
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Based on these assumptions, we propose the functional that defines the probabil- 
ity of attaining the biological goal (F) under consideration, to assume the form : 

Fq = cj> (xi, x 2 , ■ ■ ■ , x n ) dxi (1 < i < n) (6) 
Jxtes 

where <fi ls the probability density of attaining the objective (biological goal) 
and X is the feasibility domain of the contexts x%. 



Since to achieve every biological goal, many (say, m) successive stages of context 
interactions are required, we can express eq n — 6 at a higher resolution as : 

m 

<f>(x 1 ,x 2 , ■■■,x n ) = Y[ <t>j\4>j-i {xi,x 2 , ...,x n ) (7) 

J=l 

where 4>j\(j)j-\ represent the conditional probability associated with context {x^ 
interactions, while attempting to achieve a particular biological goal. 

However, since the individual physical parameters p, (pi G P), upon which the 
contexts are working, may not always be strongly correlated and although re- 
lated with each other, can be considered independent when viewed individually 
with respect to their functional contribution to the system. For example, the 
time-dependent and context-dependent fluctuations in individual bond- lengths, 
bond-angles and torsion-angles in the protein interior, although might be re- 
lated in some intricate way to the resultant dipole moment for the protein; can 
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be considered, for all practical purposes, in terms of their individual (and not 
linked) contributions in ensuring proteins stability and functionality. Hence we 
attempt to partition the relevant contexts into a sum of disjoint domains; such 
that : 

( Xi EX,) and EiXi = S. 

Considering this partition we can re-write eq n — 6 as : 

F = / .. / 4>l(xi,X 2 ,..,X n ) X (j) 2 \l(xi,X2,..,X n )..(t) m \ m -i(xi,X2,..,X n )dxldX2..dXm (8) 

In other words, purely in terms of achievement of biological goals : 

F Q = F 1 F 2 \ 1 . . . F m \ m _ x (9) 



where 



Fj\j-i = / / ... / (f)j\j-i(xi,X2,...,X n )dxldX2...dXn (10) 

Jx 1 ex 1 Jx 2 ex 2 Jx n ex n 

are the conditional probabilities of context-interactions of the system realizing 
the successive stages of the task. It is necessary to mention here that to achieve 
any biological function, the domain of integration for every Xi in eq n — 8 must 
be within their respective permissible range, say [x no , x ni ]. 

While it is difficult to assume that every context-interaction necessary to re- 
alize certain biological goal will always be operating in deterministic manner 
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with perfect efficiency, experience teaches us that biological goals are seldom 
compromised with. Hence we assume that the reliability of any arbitrarily cho- 
sen context interaction at any arbitrarily chosen j th state in the realization of 
certain biological function, is statistically independent of the probability of the 
realization of that particular biological function. In that case, the integrand in 
eq n — 10, can be expressed as a product 4>j\j-i {xi-,%2, ■ ■ ■ ,x n ) rj (xi,X2, . . . ,x n ), 
where rj,(rj S R) describes the probability of reliability of any arbitrarily chosen 
context-interaction at j th state in the realization of certain biological function. 

Hence eq n — 7, can be expressed more realistically as : 

m 

<t>(xi,X2, ■ ■ -,x n ) = \\ <Pj\j-l (xi,x 2 , ■ ■ -,x n )rj (x 1 ,x 2 , ...,x n ) (11) 



Thus, when the reliability of context-interactions are taken into account, assum- 
ing that eq n — 8 and eq n — 10 are valid, eq n — 6 and eq n — 7 can be re-written 
as : 

m 

Fo = ]Jf j \F^ 1 R j (12) 

where 

Rj = / ••• / rj (x\,X2, ■ ■ ■ , x n ) dx\dx2 ■ ■ ■ dx n (13) 

Jx 2 £X 2 Jx n eX n 
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In other words, eg" — 12, can be re-written as 



Fq — FiF 2 \i . . . F n \ n _iR-iR 2 . . . R m (14) 



Conclusion : 

While eg" — 1 to eg™ — 5 constructed the bottom-up scheme of describing the 
interactions and dependencies between biological contexts, the framework of 
equations eg" — 6 to eg" — 14 describe the top-down view of the same. Together 
these set of equations present a comprehensive way to quantitatively model the 
omnipresent "context-dependence" in biology. Evidences for the reliability of 
such mathematical treatise can easily be obtained from the various experimen- 
tally proved results that are provided to emphasize the reasonable nature of 
these formulations. Since contemporary biology, as never before, is attempting 
to be objective in its philosophy, the necessity of a mathematical model to de- 
scribe the "context-dependent" nature of it can hardly be ignored. The model 
proposed here, therefore, assumes immense importance. 
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